Multiple drug resistance gene atrD of Aspergillus nidulans

ABSTRACT

The invention provides isolated nucleic acid compounds encoding a multiple drug resistance protein of Aspergillus nidulans. Vectors and transformed host cells comprising the multiple drug resistance-encoding DNA of Aspergillus nidulans atrD are also provided. The invention further provides assays which utilize these transformed host cells.

TECHNICAL FIELD OF THE INVENTION

This invention relates to recombinant DNA technology. In particular, the invention concerns the cloning of nucleic acid encoding a multiple drug resistance protein of Aspergillus nidulans.

BACKGROUND OF THE INVENTION

Multiple drug resistance (MDR) mediated by the human mdr-1 gene product was initially recognized during the course of developing regimens for cancer chemotherapy (Fojo et al., 1987, Journal of Clinical Oncology 5:1922-1927). A multiple drug resistant cancer cell line exhibits resistance to high levels of a large variety of cytotoxic compounds. Frequently these cytotoxic compounds will have no common structural features nor will they interact with a common target within the cell. Resistance to these cytotoxic agents is mediated by an outward directed, ATP-dependent pump encoded by the mdr-1 gene. By this mechanism, toxic levels of a particular cytotoxic compound are not allowed to accumulate within the cell.

MDR-like genes have been identified in a number of divergent organisms including numerous bacterial species, the fruit fly Drosophila melanogaster, Plasmodium falciparum, the yeast Saccharomyces cerevisiae, Caenorhabditis elegans, Leishmania donovanii, marine sponges, the plant Arabidopsis thaliana, as well as Homo sapiens. Extensive searches have revealed several classes of compounds that are able to reverse the MDR phenotype of multiple drug resistant human cancer cell lines rendering them susceptible to the effects of cytotoxic compounds. These compounds, referred to herein as "MDR inhibitors", include for example, calcium channel blockers, anti-arrhythmics, antihypertensives, antibiotics, antihistamines, immuno-suppressants, steroid hormones, modified steroids, lipophilic cations, diterpenes, detergents, antidepressants, and antipsychotics (Gottesman and Pastan, 1993, Annual Review of Biochemistry 62:385-427). Clinical application of human MDR inhibitors to cancer chemotherapy has become an area of intensive focus for research.

On another front, the discovery and development of antifungal compounds for specific fungal species has also met with some degree of success. Candida species represent the majority of fungal infections, and screens for new antifungal compounds have been designed to discover anti-Candida compounds. During development of antifungal agents, activity has generally been optimized based on activity against Candida albicans. As a consequence, these anti-Candida compounds frequently do not possess clinically significant activity against other fungal species such as Aspergillus nidulans. However, it is interesting to note that at higher concentrations some anti-Candida compounds are able to kill other fungal species such as A. nidulans and A. fumigatus. This type of observation suggests that the antifungal target(s) of these anti-Candida compounds is present in A. nidulans as well. Such results indicate that A. nidulans may possess a natural mechanism of resistance that permits them to survive in clinically relevant concentrations of antifungal compounds. Until the present invention, such a general mechanism of resistance to antifungal compounds in A. nidulans has remained undescribed.

SUMMARY OF THE INVENTION

The invention provides, inter alia, isolated nucleic acid molecules that comprise nucleic acid encoding a multiple drug resistance protein from Aspergillus nidulans, herein referred to as atrD, vectors encoding atrD, and host cells transformed with these vectors.

In another embodiment, the invention provides a method for determining the fungal MDR inhibition activity of a compound which comprises:

(a) placing a culture of fungal cells, transformed with a vector capable of expressing atrD, in the presence of:

(i) an antifungal agent to which said fungal cell is resistant, but to which said fungal cell is sensitive in its untransformed state;

(ii) a compound suspected of possessing fungal MDR inhibition activity; and

(b) determining the fungal MDR inhibition activity of said compound by measuring the ability of the antifungal agent to inhibit the growth of said fungal cell.

In still another embodiment the present invention relates to strains of A. nidulans in which the atrD gene is disrupted or otherwise mutated such that the atrD protein is not produced in said strains.

In yet another embodiment, the present invention relates to a method for identifiying new antifungal compounds comprising the use of atrD⁻ gene disruption or gene replacement strains of A. nidulans.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides isolated nucleic acid molecules that comprise a nucleic acid sequence encoding atrD. The cDNA (complementary deoxyribonucleic acid) sequence encoding atrD is provided in the Sequence Listing as SEQ ID NO: 1. The amino acid sequence of the protein encoded by atrD is provided in the Sequence Listing as SEQ ID NO: 2.

Those skilled in the art will recognize that the degenerate nature of the genetic code enables one to construct many different nucleic acid sequences that encode the amino acid sequence of SEQ ID NO: 2. The cDNA sequence depicted by SEQ ID NO: 1 is only one of many possible atrD-encoding sequences. Consequently, the constructions described below and in the accompanying examples for the preferred nucleic acid molecules, vectors, and transformants of the invention are illustrative and are not intended to limit the scope of the invention.

All nucleotide and amino acid abbreviations used in this disclosure are those accepted by the United States Patent and Trademark Office as set forth in 37 C.F.R. §1.822(b)(1994).

The term "vector" refers to any autonomously replicating or integrating agent, including but not limited to plasmids, cosmids, and viruses (including phage), comprising a nucleic acid molecule to which one or more additional nucleic acid molecules can be added. Included in the definition of "vector" is the term "expression vector". Vectors are used either to amplify and/or to express deoxyribonucleic acid (DNA), either genomic or cDNA, or RNA (ribonucleic acid) which encodes atrD, or to amplify DNA or RNA that hybridizes with DNA or RNA encoding atrD.

The term "expression vector" refers to vectors which comprise a transcriptional promoter (hereinafter "promoter") and other regulatory sequences positioned to drive expression of a DNA segment that encodes atrD. Expression vectors of the present invention are replicable DNA constructs in which a DNA sequence encoding atrD is operably linked to suitable control sequences capable of effecting the expression of atrD in a suitable host. Such control sequences include a promoter, an optional operator sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding sites, and sequences which control termination of transcription and translation. DNA regions are operably linked when they are functionally related to each other. For example, a promoter is operably linked to a DNA coding sequence if it controls the transcription of the sequence, or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to permit translation.

The term "MDR inhibition activity" refers to the ability of a compound to inhibit the MDR activity of a host cell, thereby increasing the antifungal activity of an antifungal compound against said host cell.

In the present invention, atrD may be synthesized by host cells transformed with vectors that provide for the expression of DNA encoding atrD. The DNA encoding atrD may be the natural sequence or a synthetic sequence or a combination of both ("semi-synthetic sequence"). The in vitro or in vivo transcription and translation of these sequences results in the production of atrD. Synthetic and semi-synthetic sequences encoding atrD may be constructed by techniques well known in the art. See Brown et al. (1979) Methods in Enzymology, Academic Press, N.Y., 68:109-151. atrD-encoding DNA, or portions thereof, may be generated using a conventional DNA synthesizing apparatus such as the Applied Biosystems Model 380A,380B, 394 or 3948 DNA synthesizers (commercially available from Applied Biosystems, Inc., 850 Lincoln Center Drive, Foster City, Calif. 94404).

Owing to the natural degeneracy of the genetic code, the skilled artisan will recognize that a sizable yet definite number of nucleic acid sequences may be constructed which encode atrD. All such nucleic acid sequences are provided by the present invention. These sequences can be prepared by a variety of methods and, therefore, the invention is not limited to any particular preparation means. The nucleic acid sequences of the invention can be produced by a number of procedures, including DNA synthesis, cDNA cloning, genomic cloning, polymerase chain reaction (PCR) technology, or a combination of these approaches. These and other techniques are described by Maniatis, et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. (1989), or Current Protocols in Molecular Biology (F. M. Ausubel et al., 1989 and supplements). The contents of both of these references are incorporated herein by reference.

In another aspect, this invention provides the cDNA encoding atrD, which may be obtained by synthesizing the desired portion of SEQ ID NO:1 or by following the procedure carried out by Applicants. This procedure involved construction of a cosmid genomic DNA library from Aspergillus nidulans strain OC-1, a mutant derived from A42355. This library was screened for genes related to MDRs using a homologous probe generated by PCR. Degenerate PCR primers directed towards amplification of DNA sequences encoding highly conserved regions found in the ATP-binding domain of several MDR genes were synthesized. PCR using these primers and Aspergillus nidulans genomic DNA as template produced an approximately 400 base pair DNA fragment. The DNA sequence of this fragment was highly homologous to the ATP-binding region of several MDRs as predicted. This fragment was used as a hybridization probe to identify cosmid clones containing the entire atrD gene. A subclone from one such cosmid containing the entire atrD gene was sequenced to ascertain the entire sequence of atrD.

To effect the translation of atrD-encoding mRNA, one inserts the natural, synthetic, or semi-synthetic atrD-encoding DNA sequence into any of a large number of appropriate expression vectors through the use of appropriate restriction endonucleases and DNA ligases. Synthetic and semi-synthetic atrD-encoding DNA sequences can be designed, and natural atrD-encoding nucleic acid can be modified, to possess restriction endonuclease cleavage sites to facilitate isolation from and integration into these vectors. Particular restriction endonucleases employed will be dictated by the restriction endonuclease cleavage pattern of the expression vector utilized. Restriction enzyme sites are chosen so as to properly orient the atrD-encoding DNA with the control sequences to achieve proper in-frame transcription and translation of the atrD molecule. The atrD-encoding DNA must be positioned so as to be in proper reading frame with the promoter and ribosome binding site of the expression vector, both of which are functional in the host cell in which atrD is to be expressed.

Expression of atrD in fungal cells, such as Saccharomyces cerevisiae is preferred. Suitable promoter sequences for use with yeast hosts include the promoters for 3-phosphoglycerate kinase (found on plasmid pAP12BD (ATCC 53231) and described in U.S. Pat. No. 4,935,350, Jun. 19, 1990) or other glycolytic enzymes such as enolase (found on plasmid pAC1 (ATCC 39532)), glyceraldehyde-3-phosphate dehydrogenase (derived from plasmid pHcGAPC1 (ATCC 57090, 57091)), hexokinase, pyruvate decarboxylase, phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and glucokinase. Inducible yeast promoters have the additional advantage of transcription controlled by growth conditions. Such promoters include the promoter regions for alcohol dehydrogenase 2, isocytochrome C, acid phosphotase, degradative enzymes associated with nitrogen metabolism, metallothionein (contained on plasmid vector pCL28XhoLHBPV (ATCC 39475), U.S. Pat. No. 4,840,896), glyceraldehyde 3-phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization (GAL1 found on plasmid pRY121 (ATCC 37658) and on plasmid pPST5, described below). Suitable vectors and promoters for use in yeast expression are further described by R. Hitzeman et al., in European Patent Publication No. 73,657A. Yeast enhancers such as the UAS Gal enhancer from Saccharomyces cerevisiae (found in conjunction with the CYC1 promoter on plasmid YEpsec--hI1beta, ATCC 67024), also are advantageously used with yeast promoters.

A variety of expression vectors useful in the present invention are well known in the art. For expression in Saccharomyces, the plasmid YRp7, for example, (ATCC-40053, Stinchcomb et al., 1979, Nature 282:39; Kingsman et al., 1979, Gene 7:141; Tschemper et al., 1980, Gene 10:157) is commonly used. This plasmid contains the trp gene which provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example ATCC 44076 or PEP4-1 (Jones, 1977, Genetics 85:12).

Expression vectors useful in the expression of atrD can be constructed by a number of methods. For example, the cDNA sequence encoding atrD can be synthesized using DNA synthesis techniques such as those described above. Such synthetic DNA can be synthesized to contain cohesive ends that allow facile cloning into an appropriately digested expression vector. For example, the cDNA encoding atrD can be synthesized to contain NotI cohesive ends. Such a synthetic DNA fragment can be ligated into a NotI-digested expression vector such as pYES-2 (Invitrogen Corp., San Diego Calif. 92121).

An expression vector can also be constructed in the following manner. Logarithmic phase Aspergillus nidulans cells are disrupted by grinding under liquid nitrogen according to the procedure of Minuth et al., 1982 (Current Genetics 5:227-231). Aspergillus nidulans mRNA is preferably isolated from the disrupted cells using the QuickPrep® mRNA Purification Kit (Pharmacia Biotech) according to the instructions of the manufacturer. cDNA is produced from the isolated mRNA using the TimeSaver® cDNA Synthesis Kit (Pharmacia Biotech) using oligo (dT) according to the procedure described by the manufacturer. In this process an EcoRI/NotI adapter (Stratagene, Inc.) is ligated to each end of the double stranded cDNA. The adapter modified cDNA is ligated into the vector Lambda Zap^(R) II® using the Predigested Lambda Zap^(R) II®/EcoRI/CIAP Cloning Kit (Stratagene, Inc.) according to the instructions of the manufacturer to create a cDNA library.

The library is screened for full-length cDNA encoding atrD using a ³² P-radiolabeled fragment of the atrD gene. In this manner, a full-length cDNA clone is recovered from the Aspergillus nidulans cDNA library. A full-length cDNA clone recovered from the library is removed from the Lambda Zap^(R) II® vector by digestion with the restriction endonuclease NotI which produces a DNA fragment encoding atrD. The atrD encoding fragment is subcloned into plasmid pYES2 for expression studies. In this plasmid the atrD gene is operably linked to the Saccharomyces cerevisiae GAL1 promoter at the 5' end, and the yeast cycl transcription terminator at the 3' end. This plasmid further comprises the ColE1 origin of replication (ColE1) which allows replication in Escherichia coli host cells, and the ampicillin resistance gene (Amp) for selection of E. coli cells transformed with the plasmid grown in the presence of ampicillin. The expression plasmid further comprises the yeast 2μ origin of replication (2μ ori) allowing replication in yeast host cells, the yeast URA3 gene for selection of S. cerevisiae cells transformed with the plasmid grown in a medium lacking uracil, and the origin of replication from the f1 filamentous phage.

In a preferred embodiment of the invention Saccharomyces cerevisiae INVSc1 or INVSc2 cells (Invitrogen Corp., Sorrento Valley Blvd., San Diego Calif. 92121) are employed as host cells, but numerous other cell lines are available for this use. The transformed host cells are plated on an appropriate medium under selective pressure (minimal medium lacking uracil). The cultures are then incubated for a time and temperature appropriate to the host cell line employed.

The techniques involved in the transformation of yeast cells such as Saccharomyces cerevisiae cells are well known in the art and may be found in such general references as Ausubel et al., Current Protocols in Molecular Biology (1989), John Wiley & Sons, New York, N.Y. and supplements. The precise conditions under which the transformed yeast cells are cultured is dependent upon the nature of the yeast host cell line and the vectors employed.

Nucleic acid, either RNA or DNA, which encodes atrD, or a portion thereof, is also useful in producing nucleic acid molecules useful in diagnostic assays for the detection of atrD MRNA, atrD cDNA, or atrD genomic DNA. Further, nucleic acid, either RNA or DNA, which does not encode atrD, but which nonetheless is capable of hybridizing with atrD-encoding DNA or RNA is also useful in such diagnostic assays. These nucleic acid molecules may be covalently labeled by known methods with a detectable moiety such as a fluorescent group, a radioactive atom or a chemiluminescent group. The labeled nucleic acid is then used in conventional hybridization assays, such as Southern or Northern hybridization assays, or polymerase chain reaction assays (PCR), to identify hybridizing DNA, cDNA, or RNA molecules. PCR assays may also be performed using unlabeled nucleic acid molecules. Such assays may be employed to identify atrD vectors and transformants and in in vitro diagnosis to detect atrD-like MRNA, cDNA, or genomic DNA from other organisms.

U.S. patent application Ser. No. 08/111,680, the entire contents of which are hereby incorporated herein by reference, describes the use of combination therapy involving an antifungal agent possessing a proven spectrum of activity, with a fungal MDR inhibitor to treat fungal infections. This combination therapy approach enables an extension of the spectrum of antifungal activity for a given antifungal compound which previously had only demonstrated limited clinically relevant antifungal activity. Similarly, compounds with demonstrated antifungal activity can also be potentiated by a fungal MDR inhibitor such that the antifungal activity of these compounds is extended to previously resistant species. To identify compounds useful in such combination therapy the present invention provides an assay method for identifying compounds with Aspergillus nidulans MDR inhibition activity. Host cells that express atrD provide an excellent means for the identification of compounds useful as inhibitors of Aspergillus nidulans MDR activity. Generally, the assay utilizes a culture of a yeast cell transformed with a vector which provides expression of atrD. The expression of atrD by the host cell enables the host cell to grow in the presence of an antifungal compound to which the yeast cell is sensitive to in the untransformed state. Thus, the transformed yeast cell culture is grown in the presence of i) an antifungal agent to which the untransformed yeast cell is sensitive, but to which the transformed host cell is resistant, and ii) a compound that is suspected of being an MDR inhibitor. The effect of the suspected MDR inhibitor is measured by testing for the ability of the antifungal compound to inhibit the growth of the transformed yeast cell. Such inhibition will occur if the suspected Aspergillus nidulans MDR inhibitor blocks the ability of atrD to prevent the antifungal compound from acting on the yeast cell. An illustrative example of such an assay is provided in Example 3.

In order to illustrate more fully the operation of this invention, the following examples are provided, but are not to be construed as a limitation on the scope of the invention.

EXAMPLE 1 Source of the atrD-Encoding Genomic DNA and cDNA of Aspergillus nidulans

Genomic DNA encoding atrD, or the corresponding cDNA sequence (presented in SEQ ID NO:1), may be from a natural sequence, a synthetic source or a combination of both ("semi-synthetic sequence"). The in vitro or in vivo transcription and translation of these sequences results in the production of atrD. Synthetic and semi-synthetic sequences encoding atrD may be constructed by techniques well known in the art. See Brown et al. (1979) Methods in Enzymology, Academic Press, N.Y., 68:109-151. atrD-encoding DNA, or portions thereof, may be generated using a conventional DNA synthesizing apparatus such as the Applied Biosystems Model 380A, 380B, 384 or 3848 DNA synthesizers (commercially available from Applied Biosystems, Inc., 850 Lincoln Center Drive, Foster City, Calif. 94404). The polymerase chain reaction is especially useful in generating these DNA sequences. PCR primers are constructed which include the translational start (ATG) and translational stop codon (TAG) of atrD. Restriction enzyme sites may be included on these PCR primers outside of the atrD coding region to facilitate rapid cloning into expression vectors. Aspergillus nidulans genomic DNA is used as the PCR template for synthesis of atrD including introns which is useful for expression studies in closely related fungi. In contrast, cDNA is used as the PCR template for synthesis of atrD devoid of introns which is useful for expression in foreign hosts such as Saccharomyces cerevisiae or bacterial hosts such as Escherichia coli.

EXAMPLE 2 Expression of the atrD Protein

Saccharomyces cerevisiae INVSc1 cells (Invitrogen Corp., San Diego Calif. 92191) are transformed with the plasmid containing atrD by the technique described by J. D. Beggs, 1988, Nature 275:104-109). The transformed yeast cells are grown in a broth medium containing YNB/CSM-Ura/raf (YNB/CSM-Ura Yeast Nitrogen Base (Difco Laboratories, Detroit, Mich.) supplemented with CSM-URA (Bio 101, Inc.)! supplemented with 4% raffinose) at 28° C. in a shaker incubator until the culture is saturated. To induce expression of atrD, a portion of the culture is used to inoculate a flask containing YNB/CSM-Ura medium supplemented with 2% galactose (YNB/CSM-Ura/gal) rather than raffinose as the sole carbon source. The inoculated flask is incubated at 28° C. for about 16 hours.

EXAMPLE 3 Antifungal Potentiator Assay

Approximately 1×10⁶ cells of a Saccharomyces cerevisiae INVSc1 culture expressing atrD are delivered to each of several agar plates containing YNB/CSM-Ura/gal. The agar surface is allowed to dry in a biohazard hood.

An antifungal compound that the untransformed yeast cell is typically sensitive to is dissolved in an appropriate solvent at a concentration that is biologically effective. Twenty μl of the solution is delivered to an antibiotic susceptibility test disc (Difco Laboratories, Detroit, Mich.). After addition of the antifungal solution the disc is allowed to air dry in a biohazard hood. When dry, the disc is placed on the surface of the petri plates containing the transformed Saccharomyces cerevisiae INVSc1 cells.

Compounds to be tested for the ability to inhibit atrD are dissolved in dimethylsulfoxide (DMSO). The amount of compound added to the DMSO depends on the solubility of the individual compound to be tested. Twenty ml of the suspensions containing a compound to be tested are delivered to an antibiotic susceptibility test disc (Difco Laboratories, Detroit, Mich.). The disc is then placed on the surface of the dried petri plates containing the transformed Saccharomyces cerevisiae INVSc1 cells approximately 2 cm from the antifungal-containing disc. Petri plates containing the two discs are incubated at 28° C. for about 16-48 hours.

Following this incubation period, the petri plates are examined for zones of growth inhibition around the discs. A zone of growth inhibition near the antifungal disc on the test plate indicates that the compound being tested for MDR inhibition activity blocks the activity of atrD and allows the antifungal compound to inhibit the growth of the yeast host cell. Such compounds are said to possess MDR inhibition activity. Little or no zone of growth inhibition indicates that the test compound does not block MDR activity and, thus, atrD is allowed to act upon the antifungal compound to prevent its activity upon the host cell.

EXAMPLE 4 Screen For Novel Antifungal Compounds

A plasmid molecule is constructed which contains DNA sequence information required for replication and genetic transformation in E. coli (e.g. ampicillin resistance). The plasmid also comprises DNA sequences encoding a marker for selection in fungal cells (e.g. hygromycin B phosphotransferase, phleomycin resistance, G418 resistance) under the control of an A. nidulans promoter. Additionally, the plasmid contains an internal portion of the atrD gene (e.g. about 3000 base pairs which lack 500 base pairs at the N-terminal end, and about 500 base pairs at the C-terminal end of the coding region specified by SEQ ID NO:1). The atrD gene fragment enables a single crossover gene disruption when transformed or otherwise introduced into A. nidulans.

Alternatively, a 5 kilobase pair to 6 kilobase pair region of A. nidulans genomic DNA containing the atrD gene is subcloned into the aforementioned plasmid. Then, a central portion of the atrD gene is removed and replaced with a selectable marker, such as hyromycin B phosphotransferase, for a double crossover gene replacement.

Gene disruption and gene replacement procedures for A. nidulans are well known in the art (See e.g. May et al, J. Cell Biol. 101, 712, 1985; Jones and Sealy-Lewis, Curr. Genet. 17, 81, 1990). Transformants are recovered on an appropriate selection medium, for example, hygromycin (if hygromycin B gene is used in the construction of disruption cassette). Gene replacement, or gene disruption, is verified by any suitable method, for example, by Southern blot hybridization.

Gene disruption or gene replacement strains are rendered hypersensitive to antifungal compounds, and are useful in screens for new antifungal compounds in whole cell growth inhibition studies.

    __________________________________________________________________________     #             SEQUENCE LISTING     - (1) GENERAL INFORMATION:     -    (iii) NUMBER OF SEQUENCES: 3     - (2) INFORMATION FOR SEQ ID NO:1:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 4002 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: cDNA     -    (iii) HYPOTHETICAL: NO     -     (iv) ANTI-SENSE: NO     -     (ix) FEATURE:               (A) NAME/KEY: CDS               (B) LOCATION: 1..4002     #ID NO:1: (xi) SEQUENCE DESCRIPTION: SEQ     - ATG TCC CCG CTA GAG ACA AAT CCC CTT TCG CC - #A GAG ACT GCT ATG CGC       48     Met Ser Pro Leu Glu Thr Asn Pro Leu Ser Pr - #o Glu Thr Ala Met Arg     #                 15     - GAA CCT GCT GAG ACT TCA ACG ACG GAG GAG CA - #A GCT TCT ACA CCA CAC       96     Glu Pro Ala Glu Thr Ser Thr Thr Glu Glu Gl - #n Ala Ser Thr Pro His     #             30     - GCT GCG GAC GAG AAG AAA ATC CTC AGC GAC CT - #C TCG GCT CCA TCT AGT      144     Ala Ala Asp Glu Lys Lys Ile Leu Ser Asp Le - #u Ser Ala Pro Ser Ser     #         45     - ACT ACA GCA ACC CCC GCA GAC AAG GAG CAC CG - #T CCT AAA TCG TCG TCC      192     Thr Thr Ala Thr Pro Ala Asp Lys Glu His Ar - #g Pro Lys Ser Ser Ser     #     60     - AGC AAT AAT GCG GTC TCG GTC AAC GAA GTC GA - #T GCG CTT ATT GCG CAC      240     Ser Asn Asn Ala Val Ser Val Asn Glu Val As - #p Ala Leu Ile Ala His     # 80     - CTG CCA GAA GAC GAG AGG CAG GTC TTG AAG AC - #G CAG CTG GAG GAG ATC      288     Leu Pro Glu Asp Glu Arg Gln Val Leu Lys Th - #r Gln Leu Glu Glu Ile     #                 95     - AAA GTA AAC ATC TCC TTC TTC GGT CTC TGG CG - #G TAT GCA ACA AAG ATG      336     Lys Val Asn Ile Ser Phe Phe Gly Leu Trp Ar - #g Tyr Ala Thr Lys Met     #           110     - GAT ATA CTT ATC ATG GTA ATC AGT ACA ATC TG - #T GCC ATT GCT GCC GCG      384     Asp Ile Leu Ile Met Val Ile Ser Thr Ile Cy - #s Ala Ile Ala Ala Ala     #       125     - TCG ACT TTC CAG AGG ATA ATG TTA TAT CAA AT - #C TCG TAC GAC GAG TTC      432     Ser Thr Phe Gln Arg Ile Met Leu Tyr Gln Il - #e Ser Tyr Asp Glu Phe     #   140     - TAT GAT GAA TTG ACC AAG AAC GTA CTG TAC TT - #C GTA TAC CTC GGT ATC      480     Tyr Asp Glu Leu Thr Lys Asn Val Leu Tyr Ph - #e Val Tyr Leu Gly Ile     145                 1 - #50                 1 - #55                 1 -     #60     - GGC GAG TTT GTC ACT GTC TAT GTT AGT ACT GT - #T GGC TTC ATC TAT ACC      528     Gly Glu Phe Val Thr Val Tyr Val Ser Thr Va - #l Gly Phe Ile Tyr Thr     #               175     - GGA GAA CAC GCC ACG CAG AAG ATC CGC GAG TA - #T TAC CTT GAG TCT ATC      576     Gly Glu His Ala Thr Gln Lys Ile Arg Glu Ty - #r Tyr Leu Glu Ser Ile     #           190     - CTG CGC CAG AAC ATT GGC TAT TTT GAT AAA CT - #C GGT GCC GGG GAA GTG      624     Leu Arg Gln Asn Ile Gly Tyr Phe Asp Lys Le - #u Gly Ala Gly Glu Val     #       205     - ACC ACC CGT ATA ACA GCC GAT ACA AAC CTT AT - #C CAG GAT GGC ATT TCG      672     Thr Thr Arg Ile Thr Ala Asp Thr Asn Leu Il - #e Gln Asp Gly Ile Ser     #   220     - GAG AAG GTC GGT CTC ACT TTG ACT GCC CTG GC - #G ACA TTC GTG ACA GCA      720     Glu Lys Val Gly Leu Thr Leu Thr Ala Leu Al - #a Thr Phe Val Thr Ala     225                 2 - #30                 2 - #35                 2 -     #40     - TTC ATT ATC GCC TAC GTC AAA TAC TGG AAG TT - #G GCT CTA ATT TGC AGC      768     Phe Ile Ile Ala Tyr Val Lys Tyr Trp Lys Le - #u Ala Leu Ile Cys Ser     #               255     - TCA ACA ATT GTG GCC CTC GTT CTC ACC ATG GG - #C GGT GGT TCT CAG TTT      816     Ser Thr Ile Val Ala Leu Val Leu Thr Met Gl - #y Gly Gly Ser Gln Phe     #           270     - ATC ATC AAG TAC AGC AAA AAG TCG CTT GAC AG - #C TAC GGT GCA GGC GGC      864     Ile Ile Lys Tyr Ser Lys Lys Ser Leu Asp Se - #r Tyr Gly Ala Gly Gly     #       285     - ACT GTT GCG GAA GAG GTC ATC AGC TCC ATC AG - #A AAT GCC ACA GCG TTT      912     Thr Val Ala Glu Glu Val Ile Ser Ser Ile Ar - #g Asn Ala Thr Ala Phe     #   300     - GGC ACC CAA GAC AAG CTT GCG AAG CAG TAT GA - #G GTC CAC TTA GAC GAA      960     Gly Thr Gln Asp Lys Leu Ala Lys Gln Tyr Gl - #u Val His Leu Asp Glu     305                 3 - #10                 3 - #15                 3 -     #20     - GCT GAG AAA TGG GGA ACA AAG AAC CAG ATT GT - #C ATG GGT TTC ATG ATT     1008     Ala Glu Lys Trp Gly Thr Lys Asn Gln Ile Va - #l Met Gly Phe Met Ile     #               335     - GGC GCC ATG TTT GGC CTT ATG TAC TCG AAC TA - #C GGT CTT GGC TTC TGG     1056     Gly Ala Met Phe Gly Leu Met Tyr Ser Asn Ty - #r Gly Leu Gly Phe Trp     #           350     - ATG GGT TCT CGT TTC CTG GTA GAT GGT GCA GT - #C GAT GTG GGT GAT ATT     1104     Met Gly Ser Arg Phe Leu Val Asp Gly Ala Va - #l Asp Val Gly Asp Ile     #       365     - CTC ACA GTT CTC ATG GCC ATC TTG ATC GGA TC - #G TTC TCC TTG GGG AAC     1152     Leu Thr Val Leu Met Ala Ile Leu Ile Gly Se - #r Phe Ser Leu Gly Asn     #   380     - GTT AGT CCA AAT GCT CAA GCA TTT ACA AAC GC - #T GTG GCC GCG GCC GCA     1200     Val Ser Pro Asn Ala Gln Ala Phe Thr Asn Al - #a Val Ala Ala Ala Ala     385                 3 - #90                 3 - #95                 4 -     #00     - AAG ATA TTT GGA ACG ATC GAT CGC CAG TCC CC - #A TTA GAT CCA TAT TCG     1248     Lys Ile Phe Gly Thr Ile Asp Arg Gln Ser Pr - #o Leu Asp Pro Tyr Ser     #               415     - AAC GAA GGG AAG ACG CTC GAC CAT TTT GAG GG - #C CAC ATT GAG TTA CGC     1296     Asn Glu Gly Lys Thr Leu Asp His Phe Glu Gl - #y His Ile Glu Leu Arg     #           430     - AAT GTC AAG CAT ATT TAC CCA TCT AGA CCC GA - #G GTC ACC GTC ATG GAG     1344     Asn Val Lys His Ile Tyr Pro Ser Arg Pro Gl - #u Val Thr Val Met Glu     #       445     - GAT GTT TCT CTG TCA ATG CCC GCT GGA AAA AC - #A ACC GCT TTA GTC GGC     1392     Asp Val Ser Leu Ser Met Pro Ala Gly Lys Th - #r Thr Ala Leu Val Gly     #   460     - CCC TCT GGC TCT GGA AAA AGT ACG GTG GTC GG - #C TTG GTT GAG CGA TTC     1440     Pro Ser Gly Ser Gly Lys Ser Thr Val Val Gl - #y Leu Val Glu Arg Phe     465                 4 - #70                 4 - #75                 4 -     #80     - TAC ATG CCT GTT CGC GGT ACG GTT TTG CTG GA - #T GGC CAT GAC ATC AAG     1488     Tyr Met Pro Val Arg Gly Thr Val Leu Leu As - #p Gly His Asp Ile Lys     #               495     - GAC CTC AAT CTC CGC TGG CTT CGC CAA CAG AT - #C TCT TTG GTT AGC CAG     1536     Asp Leu Asn Leu Arg Trp Leu Arg Gln Gln Il - #e Ser Leu Val Ser Gln     #           510     - GAG CCT GTT CTT TTT GGC ACG ACG ATT TAT AA - #G AAT ATT AGG CAC GGT     1584     Glu Pro Val Leu Phe Gly Thr Thr Ile Tyr Ly - #s Asn Ile Arg His Gly     #       525     - CTC ATC GGC ACA AAG TAC GAG AAT GAA TCC GA - #G GAT AAG GTC CGG GAA     1632     Leu Ile Gly Thr Lys Tyr Glu Asn Glu Ser Gl - #u Asp Lys Val Arg Glu     #   540     - CTC ATC GAG AAC GCG GCA AAA ATG GCG AAT GC - #T CAT GAC TTT ATT ACT     1680     Leu Ile Glu Asn Ala Ala Lys Met Ala Asn Al - #a His Asp Phe Ile Thr     545                 5 - #50                 5 - #55                 5 -     #60     - GCC TTG CCT GAA GGT TAT GAG ACC AAT GTT GG - #G CAG CGT GGC TTT CTC     1728     Ala Leu Pro Glu Gly Tyr Glu Thr Asn Val Gl - #y Gln Arg Gly Phe Leu     #               575     - CTT TCA GGT GGC CAG AAA CAG CGC ATT GCA AT - #C GCC CGT GCC GTT GTT     1776     Leu Ser Gly Gly Gln Lys Gln Arg Ile Ala Il - #e Ala Arg Ala Val Val     #           590     - AGT GAC CCA AAA ATC CTG CTC CTG GAT GAA GC - #T ACT TCG GCC TTG GAC     1824     Ser Asp Pro Lys Ile Leu Leu Leu Asp Glu Al - #a Thr Ser Ala Leu Asp     #       605     - ACA AAA TCC GAA GGC GTG GTT CAA GCA GCT TT - #G GAG AGG GCA GCT GAA     1872     Thr Lys Ser Glu Gly Val Val Gln Ala Ala Le - #u Glu Arg Ala Ala Glu     #   620     - GGC CGA ACT ACT ATT GTG ATC GCT CAT CGC CT - #T TCC ACG ATC AAA ACG     1920     Gly Arg Thr Thr Ile Val Ile Ala His Arg Le - #u Ser Thr Ile Lys Thr     625                 6 - #30                 6 - #35                 6 -     #40     - GCG CAC AAC ATT GTG GTT CTG GTC AAT GGC AA - #A ATT GCT GAA CAA GGA     1968     Ala His Asn Ile Val Val Leu Val Asn Gly Ly - #s Ile Ala Glu Gln Gly     #               655     - ACT CAC GAT GAA TTG GTT GAC CGC GGA GGC GC - #T TAT CGC AAA CTT GTG     2016     Thr His Asp Glu Leu Val Asp Arg Gly Gly Al - #a Tyr Arg Lys Leu Val     #           670     - GAG GCT CAA CGT ATC AAT GAA CAG AAG GAA GC - #T GAC GCC TTG GAG GAC     2064     Glu Ala Gln Arg Ile Asn Glu Gln Lys Glu Al - #a Asp Ala Leu Glu Asp     #       685     - GCC GAC GCT GAG GAT CTC ACG AAT GCA GAT AT - #T GCC AAA ATC AAA ACT     2112     Ala Asp Ala Glu Asp Leu Thr Asn Ala Asp Il - #e Ala Lys Ile Lys Thr     #   700     - GCG TCA AGC GCA TCA TCC GAT CTC GAC GGA AA - #A CCC ACA ACC ATT GAC     2160     Ala Ser Ser Ala Ser Ser Asp Leu Asp Gly Ly - #s Pro Thr Thr Ile Asp     705                 7 - #10                 7 - #15                 7 -     #20     - CGC ACG GGC ACC CAC AAG TCT GTT TCC AGC GC - #G ATT CTT TCT AAA AGA     2208     Arg Thr Gly Thr His Lys Ser Val Ser Ser Al - #a Ile Leu Ser Lys Arg     #               735     - CCC CCC GAA ACA ACT CCG AAA TAC TCA TTA TG - #G ACG CTG CTC AAA TTT     2256     Pro Pro Glu Thr Thr Pro Lys Tyr Ser Leu Tr - #p Thr Leu Leu Lys Phe     #           750     - GTT GCT TCC TTC AAC CGC CCT GAA ATC CCG TA - #C ATG CTC ATC GGT CTT     2304     Val Ala Ser Phe Asn Arg Pro Glu Ile Pro Ty - #r Met Leu Ile Gly Leu     #       765     - GTC TTC TCA GTG TTA GCT GGT GGT GGC CAA CC - #C ACG CAA GCA GTG CTA     2352     Val Phe Ser Val Leu Ala Gly Gly Gly Gln Pr - #o Thr Gln Ala Val Leu     #   780     - TAT GCT AAA GCC ATC AGC ACA CTC TCG CTC CC - #A GAA TCA CAA TAT AGC     2400     Tyr Ala Lys Ala Ile Ser Thr Leu Ser Leu Pr - #o Glu Ser Gln Tyr Ser     785                 7 - #90                 7 - #95                 8 -     #00     - AAG CTT CGA CAT GAT GCG GAT TTC TGG TCA TT - #G ATG TTC TTC GTG GTT     2448     Lys Leu Arg His Asp Ala Asp Phe Trp Ser Le - #u Met Phe Phe Val Val     #               815     - GGT ATC ATT CAG TTT ATC ACG CAG TCA ACC AA - #T GGT GCT GCA TTT GCC     2496     Gly Ile Ile Gln Phe Ile Thr Gln Ser Thr As - #n Gly Ala Ala Phe Ala     #           830     - GTA TGC TCC GAG AGA CTT ATT CGT CGC GCG AG - #A AGC ACT GCC TTT CGG     2544     Val Cys Ser Glu Arg Leu Ile Arg Arg Ala Ar - #g Ser Thr Ala Phe Arg     #       845     - ACG ATA CTC CGT CAA GAC ATT GCT TTC TTT GA - #C AAG GAA GAG AAT AGC     2592     Thr Ile Leu Arg Gln Asp Ile Ala Phe Phe As - #p Lys Glu Glu Asn Ser     #   860     - ACC GGC GCT CTG ACC TCT TTC CTG TCC ACC GA - #G ACG AAG CAT CTC TCC     2640     Thr Gly Ala Leu Thr Ser Phe Leu Ser Thr Gl - #u Thr Lys His Leu Ser     865                 8 - #70                 8 - #75                 8 -     #80     - GGT GTT AGC GGT GTG ACT CTA GGC ACG ATC TT - #G ATG ACC TCC ACG ACC     2688     Gly Val Ser Gly Val Thr Leu Gly Thr Ile Le - #u Met Thr Ser Thr Thr     #               895     - CTA GGA GCG GCT ATC ATT ATT GCC CTG GCG AT - #T GGG TGG AAA TTG GCC     2736     Leu Gly Ala Ala Ile Ile Ile Ala Leu Ala Il - #e Gly Trp Lys Leu Ala     #           910     - TTA GTT TGT ATC TCG GTT GTG CCG GTT CTC CT - #G GCA TGC GGT TTC TAC     2784     Leu Val Cys Ile Ser Val Val Pro Val Leu Le - #u Ala Cys Gly Phe Tyr     #       925     - CGA TTC TAT ATG CTA GCC CAG TTT CAA TCA CG - #C TCC AAG CTT GCT TAT     2832     Arg Phe Tyr Met Leu Ala Gln Phe Gln Ser Ar - #g Ser Lys Leu Ala Tyr     #   940     - GAG GGA TCT GCA AAC TTT GCT TGC GAG GCT AC - #A TCG TCT ATC CGC ACA     2880     Glu Gly Ser Ala Asn Phe Ala Cys Glu Ala Th - #r Ser Ser Ile Arg Thr     945                 9 - #50                 9 - #55                 9 -     #60     - GTT GCG TCA TTA ACC CGG GAA AGG GAT GTC TG - #G GAG ATT TAC CAT GCC     2928     Val Ala Ser Leu Thr Arg Glu Arg Asp Val Tr - #p Glu Ile Tyr His Ala     #               975     - CAG CTT GAC GCA CAA GGC AGG ACC AGT CTA AT - #C TCT GTC TTG AGG TCA     2976     Gln Leu Asp Ala Gln Gly Arg Thr Ser Leu Il - #e Ser Val Leu Arg Ser     #           990     - TCC CTG TTA TAT GCG TCG TCG CAG GCA CTT GT - #T TTC TTC TGC GTT GCG     3024     Ser Leu Leu Tyr Ala Ser Ser Gln Ala Leu Va - #l Phe Phe Cys Val Ala     #      10050     - CTC GGG TTT TGG TAC GGA GGG ACA CTT CTT GG - #T CAC CAC GAG TAT GAC     3072     Leu Gly Phe Trp Tyr Gly Gly Thr Leu Leu Gl - #y His His Glu Tyr Asp     #  10205     - ATT TTC CGC TTC TTT GTT TGT TTC TCC GAG AT - #T CTC TTT GGT GCT CAA     3120     Ile Phe Arg Phe Phe Val Cys Phe Ser Glu Il - #e Leu Phe Gly Ala Gln     #               10401030 - #                1035     - TCC GCG GGC ACC GTC TTT TCC TTT GCA CCA GA - #C ATG GGC AAG GCG AAG     3168     Ser Ala Gly Thr Val Phe Ser Phe Ala Pro As - #p Met Gly Lys Ala Lys     #              10550     - AAT GCG GCC GCC GAA TTC CGA CGA CTG TTC GA - #C CGA AAG CCA CAA ATT     3216     Asn Ala Ala Ala Glu Phe Arg Arg Leu Phe As - #p Arg Lys Pro Gln Ile     #          10705     - GAT AAC TGG TCT GAA GAG GGC GAG AAG CTC GA - #A ACG GTG GAA GGT GAA     3264     Asp Asn Trp Ser Glu Glu Gly Glu Lys Leu Gl - #u Thr Val Glu Gly Glu     #      10850     - ATC GAA TTT AGG AAC GTG CAC TTC AGA TAC CC - #G ACC CGC CCA GAA CAG     3312     Ile Glu Phe Arg Asn Val His Phe Arg Tyr Pr - #o Thr Arg Pro Glu Gln     #  11005     - CCT GTC CTG CGC GGC TTG GAC CTG ACC GTG AA - #G CCT GGA CAA TAT GTT     3360     Pro Val Leu Arg Gly Leu Asp Leu Thr Val Ly - #s Pro Gly Gln Tyr Val     #               11201110 - #                1115     - GCG CTT GTC GGA CCC AGC GGT TGT GGC AAG AG - #T ACC ACC ATT GCA TTG     3408     Ala Leu Val Gly Pro Ser Gly Cys Gly Lys Se - #r Thr Thr Ile Ala Leu     #              11350     - CTT GAG CGC TTT TAC GAT GCG ATT GCC GGG TC - #C ATC CTT GTT GAT GGG     3456     Leu Glu Arg Phe Tyr Asp Ala Ile Ala Gly Se - #r Ile Leu Val Asp Gly     #          11505     - AAG GAC ATA AGT AAA CTA AAT ATC AAC TCC TA - #C CGC AGC TTT CTG TCA     3504     Lys Asp Ile Ser Lys Leu Asn Ile Asn Ser Ty - #r Arg Ser Phe Leu Ser     #      11650     - CTG GTC AGC CAG GAG CCG ACA CTG TAC CAG GG - #C ACC ATC AAG GAA AAC     3552     Leu Val Ser Gln Glu Pro Thr Leu Tyr Gln Gl - #y Thr Ile Lys Glu Asn     #  11805     - ATC TTA CTT GGT ATT GTC GAA GAT GAC GTA CC - #G GAA GAA TTC TTG ATT     3600     Ile Leu Leu Gly Ile Val Glu Asp Asp Val Pr - #o Glu Glu Phe Leu Ile     #               12001190 - #                1195     - AAG GCT TGC AAG GAC GCT AAT ATC TAC GAC TT - #C ATC ATG TCG CTC CCG     3648     Lys Ala Cys Lys Asp Ala Asn Ile Tyr Asp Ph - #e Ile Met Ser Leu Pro     #              12150     - GAG GGC TTT AAT ACA GTT GTT GGC AGC AAG GG - #A GGC ATG TTG TCT GGC     3696     Glu Gly Phe Asn Thr Val Val Gly Ser Lys Gl - #y Gly Met Leu Ser Gly     #          12305     - GGC CAA AAG CAA CGT GTG GCC ATT GCC CGA GC - #C CTT CTT CGG GAT CCC     3744     Gly Gln Lys Gln Arg Val Ala Ile Ala Arg Al - #a Leu Leu Arg Asp Pro     #      12450     - AAA ATC CTT CTT CTC GAT GAA GCG ACG TCA GC - #C CTC GAC TCC GAG TCA     3792     Lys Ile Leu Leu Leu Asp Glu Ala Thr Ser Al - #a Leu Asp Ser Glu Ser     #  12605     - GAA AAG GTC GTC CAG GCG GCT TTG GAT GCC GC - #T GCC CGA GGC CGA ACC     3840     Glu Lys Val Val Gln Ala Ala Leu Asp Ala Al - #a Ala Arg Gly Arg Thr     #               12801270 - #                1275     - ACA ATC GCC GTT GCA CAC CGA CTC AGC ACG AT - #T CAA AAG GCG GAC GTT     3888     Thr Ile Ala Val Ala His Arg Leu Ser Thr Il - #e Gln Lys Ala Asp Val     #              12950     - ATC TAT GTT TTC GAC CAA GGC AAG ATC GTC GA - #A AGC GGA ACG CAC AGC     3936     Ile Tyr Val Phe Asp Gln Gly Lys Ile Val Gl - #u Ser Gly Thr His Ser     #          13105     - GAA CTG GTC CAG AAA AAG GGC CGG TAC TAC GA - #G CTG GTC AAC TTG CAG     3984     Glu Leu Val Gln Lys Lys Gly Arg Tyr Tyr Gl - #u Leu Val Asn Leu Gln     #      13250     #4002              GC CAT     Ser Leu Gly Lys Gly His         1330     - (2) INFORMATION FOR SEQ ID NO:2:     -      (i) SEQUENCE CHARACTERISTICS:     #acids    (A) LENGTH: 1334 amino               (B) TYPE: amino acid               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: protein     -           (xi) SEQUENCE DESCRIPTION: - # SEQ ID NO:2:     - Met Ser Pro Leu Glu Thr Asn Pro Leu Ser Pr - #o Glu Thr Ala Met Arg     #                 15     - Glu Pro Ala Glu Thr Ser Thr Thr Glu Glu Gl - #n Ala Ser Thr Pro His     #             30     - Ala Ala Asp Glu Lys Lys Ile Leu Ser Asp Le - #u Ser Ala Pro Ser Ser     #         45     - Thr Thr Ala Thr Pro Ala Asp Lys Glu His Ar - #g Pro Lys Ser Ser Ser     #     60     - Ser Asn Asn Ala Val Ser Val Asn Glu Val As - #p Ala Leu Ile Ala His     # 80     - Leu Pro Glu Asp Glu Arg Gln Val Leu Lys Th - #r Gln Leu Glu Glu Ile     #                 95     - Lys Val Asn Ile Ser Phe Phe Gly Leu Trp Ar - #g Tyr Ala Thr Lys Met     #           110     - Asp Ile Leu Ile Met Val Ile Ser Thr Ile Cy - #s Ala Ile Ala Ala Ala     #       125     - Ser Thr Phe Gln Arg Ile Met Leu Tyr Gln Il - #e Ser Tyr Asp Glu Phe     #   140     - Tyr Asp Glu Leu Thr Lys Asn Val Leu Tyr Ph - #e Val Tyr Leu Gly Ile     145                 1 - #50                 1 - #55                 1 -     #60     - Gly Glu Phe Val Thr Val Tyr Val Ser Thr Va - #l Gly Phe Ile Tyr Thr     #               175     - Gly Glu His Ala Thr Gln Lys Ile Arg Glu Ty - #r Tyr Leu Glu Ser Ile     #           190     - Leu Arg Gln Asn Ile Gly Tyr Phe Asp Lys Le - #u Gly Ala Gly Glu Val     #       205     - Thr Thr Arg Ile Thr Ala Asp Thr Asn Leu Il - #e Gln Asp Gly Ile Ser     #   220     - Glu Lys Val Gly Leu Thr Leu Thr Ala Leu Al - #a Thr Phe Val Thr Ala     225                 2 - #30                 2 - #35                 2 -     #40     - Phe Ile Ile Ala Tyr Val Lys Tyr Trp Lys Le - #u Ala Leu Ile Cys Ser     #               255     - Ser Thr Ile Val Ala Leu Val Leu Thr Met Gl - #y Gly Gly Ser Gln Phe     #           270     - Ile Ile Lys Tyr Ser Lys Lys Ser Leu Asp Se - #r Tyr Gly Ala Gly Gly     #       285     - Thr Val Ala Glu Glu Val Ile Ser Ser Ile Ar - #g Asn Ala Thr Ala Phe     #   300     - Gly Thr Gln Asp Lys Leu Ala Lys Gln Tyr Gl - #u Val His Leu Asp Glu     305                 3 - #10                 3 - #15                 3 -     #20     - Ala Glu Lys Trp Gly Thr Lys Asn Gln Ile Va - #l Met Gly Phe Met Ile     #               335     - Gly Ala Met Phe Gly Leu Met Tyr Ser Asn Ty - #r Gly Leu Gly Phe Trp     #           350     - Met Gly Ser Arg Phe Leu Val Asp Gly Ala Va - #l Asp Val Gly Asp Ile     #       365     - Leu Thr Val Leu Met Ala Ile Leu Ile Gly Se - #r Phe Ser Leu Gly Asn     #   380     - Val Ser Pro Asn Ala Gln Ala Phe Thr Asn Al - #a Val Ala Ala Ala Ala     385                 3 - #90                 3 - #95                 4 -     #00     - Lys Ile Phe Gly Thr Ile Asp Arg Gln Ser Pr - #o Leu Asp Pro Tyr Ser     #               415     - Asn Glu Gly Lys Thr Leu Asp His Phe Glu Gl - #y His Ile Glu Leu Arg     #           430     - Asn Val Lys His Ile Tyr Pro Ser Arg Pro Gl - #u Val Thr Val Met Glu     #       445     - Asp Val Ser Leu Ser Met Pro Ala Gly Lys Th - #r Thr Ala Leu Val Gly     #   460     - Pro Ser Gly Ser Gly Lys Ser Thr Val Val Gl - #y Leu Val Glu Arg Phe     465                 4 - #70                 4 - #75                 4 -     #80     - Tyr Met Pro Val Arg Gly Thr Val Leu Leu As - #p Gly His Asp Ile Lys     #               495     - Asp Leu Asn Leu Arg Trp Leu Arg Gln Gln Il - #e Ser Leu Val Ser Gln     #           510     - Glu Pro Val Leu Phe Gly Thr Thr Ile Tyr Ly - #s Asn Ile Arg His Gly     #       525     - Leu Ile Gly Thr Lys Tyr Glu Asn Glu Ser Gl - #u Asp Lys Val Arg Glu     #   540     - Leu Ile Glu Asn Ala Ala Lys Met Ala Asn Al - #a His Asp Phe Ile Thr     545                 5 - #50                 5 - #55                 5 -     #60     - Ala Leu Pro Glu Gly Tyr Glu Thr Asn Val Gl - #y Gln Arg Gly Phe Leu     #               575     - Leu Ser Gly Gly Gln Lys Gln Arg Ile Ala Il - #e Ala Arg Ala Val Val     #           590     - Ser Asp Pro Lys Ile Leu Leu Leu Asp Glu Al - #a Thr Ser Ala Leu Asp     #       605     - Thr Lys Ser Glu Gly Val Val Gln Ala Ala Le - #u Glu Arg Ala Ala Glu     #   620     - Gly Arg Thr Thr Ile Val Ile Ala His Arg Le - #u Ser Thr Ile Lys Thr     625                 6 - #30                 6 - #35                 6 -     #40     - Ala His Asn Ile Val Val Leu Val Asn Gly Ly - #s Ile Ala Glu Gln Gly     #               655     - Thr His Asp Glu Leu Val Asp Arg Gly Gly Al - #a Tyr Arg Lys Leu Val     #           670     - Glu Ala Gln Arg Ile Asn Glu Gln Lys Glu Al - #a Asp Ala Leu Glu Asp     #       685     - Ala Asp Ala Glu Asp Leu Thr Asn Ala Asp Il - #e Ala Lys Ile Lys Thr     #   700     - Ala Ser Ser Ala Ser Ser Asp Leu Asp Gly Ly - #s Pro Thr Thr Ile Asp     705                 7 - #10                 7 - #15                 7 -     #20     - Arg Thr Gly Thr His Lys Ser Val Ser Ser Al - #a Ile Leu Ser Lys Arg     #               735     - Pro Pro Glu Thr Thr Pro Lys Tyr Ser Leu Tr - #p Thr Leu Leu Lys Phe     #           750     - Val Ala Ser Phe Asn Arg Pro Glu Ile Pro Ty - #r Met Leu Ile Gly Leu     #       765     - Val Phe Ser Val Leu Ala Gly Gly Gly Gln Pr - #o Thr Gln Ala Val Leu     #   780     - Tyr Ala Lys Ala Ile Ser Thr Leu Ser Leu Pr - #o Glu Ser Gln Tyr Ser     785                 7 - #90                 7 - #95                 8 -     #00     - Lys Leu Arg His Asp Ala Asp Phe Trp Ser Le - #u Met Phe Phe Val Val     #               815     - Gly Ile Ile Gln Phe Ile Thr Gln Ser Thr As - #n Gly Ala Ala Phe Ala     #           830     - Val Cys Ser Glu Arg Leu Ile Arg Arg Ala Ar - #g Ser Thr Ala Phe Arg     #       845     - Thr Ile Leu Arg Gln Asp Ile Ala Phe Phe As - #p Lys Glu Glu Asn Ser     #   860     - Thr Gly Ala Leu Thr Ser Phe Leu Ser Thr Gl - #u Thr Lys His Leu Ser     865                 8 - #70                 8 - #75                 8 -     #80     - Gly Val Ser Gly Val Thr Leu Gly Thr Ile Le - #u Met Thr Ser Thr Thr     #               895     - Leu Gly Ala Ala Ile Ile Ile Ala Leu Ala Il - #e Gly Trp Lys Leu Ala     #           910     - Leu Val Cys Ile Ser Val Val Pro Val Leu Le - #u Ala Cys Gly Phe Tyr     #       925     - Arg Phe Tyr Met Leu Ala Gln Phe Gln Ser Ar - #g Ser Lys Leu Ala Tyr     #   940     - Glu Gly Ser Ala Asn Phe Ala Cys Glu Ala Th - #r Ser Ser Ile Arg Thr     945                 9 - #50                 9 - #55                 9 -     #60     - Val Ala Ser Leu Thr Arg Glu Arg Asp Val Tr - #p Glu Ile Tyr His Ala     #               975     - Gln Leu Asp Ala Gln Gly Arg Thr Ser Leu Il - #e Ser Val Leu Arg Ser     #           990     - Ser Leu Leu Tyr Ala Ser Ser Gln Ala Leu Va - #l Phe Phe Cys Val Ala     #      10050     - Leu Gly Phe Trp Tyr Gly Gly Thr Leu Leu Gl - #y His His Glu Tyr Asp     #  10205     - Ile Phe Arg Phe Phe Val Cys Phe Ser Glu Il - #e Leu Phe Gly Ala Gln     #               10401030 - #                1035     - Ser Ala Gly Thr Val Phe Ser Phe Ala Pro As - #p Met Gly Lys Ala Lys     #              10550     - Asn Ala Ala Ala Glu Phe Arg Arg Leu Phe As - #p Arg Lys Pro Gln Ile     #          10705     - Asp Asn Trp Ser Glu Glu Gly Glu Lys Leu Gl - #u Thr Val Glu Gly Glu     #      10850     - Ile Glu Phe Arg Asn Val His Phe Arg Tyr Pr - #o Thr Arg Pro Glu Gln     #  11005     - Pro Val Leu Arg Gly Leu Asp Leu Thr Val Ly - #s Pro Gly Gln Tyr Val     #               11201110 - #                1115     - Ala Leu Val Gly Pro Ser Gly Cys Gly Lys Se - #r Thr Thr Ile Ala Leu     #              11350     - Leu Glu Arg Phe Tyr Asp Ala Ile Ala Gly Se - #r Ile Leu Val Asp Gly     #          11505     - Lys Asp Ile Ser Lys Leu Asn Ile Asn Ser Ty - #r Arg Ser Phe Leu Ser     #      11650     - Leu Val Ser Gln Glu Pro Thr Leu Tyr Gln Gl - #y Thr Ile Lys Glu Asn     #  11805     - Ile Leu Leu Gly Ile Val Glu Asp Asp Val Pr - #o Glu Glu Phe Leu Ile     #               12001190 - #                1195     - Lys Ala Cys Lys Asp Ala Asn Ile Tyr Asp Ph - #e Ile Met Ser Leu Pro     #              12150     - Glu Gly Phe Asn Thr Val Val Gly Ser Lys Gl - #y Gly Met Leu Ser Gly     #          12305     - Gly Gln Lys Gln Arg Val Ala Ile Ala Arg Al - #a Leu Leu Arg Asp Pro     #      12450     - Lys Ile Leu Leu Leu Asp Glu Ala Thr Ser Al - #a Leu Asp Ser Glu Ser     #  12605     - Glu Lys Val Val Gln Ala Ala Leu Asp Ala Al - #a Ala Arg Gly Arg Thr     #               12801270 - #                1275     - Thr Ile Ala Val Ala His Arg Leu Ser Thr Il - #e Gln Lys Ala Asp Val     #              12950     - Ile Tyr Val Phe Asp Gln Gly Lys Ile Val Gl - #u Ser Gly Thr His Ser     #          13105     - Glu Leu Val Gln Lys Lys Gly Arg Tyr Tyr Gl - #u Leu Val Asn Leu Gln     #      13250     - Ser Leu Gly Lys Gly His         1330     - (2) INFORMATION FOR SEQ ID NO:3:     -      (i) SEQUENCE CHARACTERISTICS:     #pairs    (A) LENGTH: 4002 base               (B) TYPE: nucleic acid               (C) STRANDEDNESS: single               (D) TOPOLOGY: linear     -     (ii) MOLECULE TYPE: mRNA     -    (iii) HYPOTHETICAL: NO     -     (iv) ANTI-SENSE: NO     #ID NO:3: (xi) SEQUENCE DESCRIPTION: SEQ     - AUGUCCCCGC UAGAGACAAA UCCCCUUUCG CCAGAGACUG CUAUGCGCGA AC - #CUGCUGAG       60     - ACUUCAACGA CGGAGGAGCA AGCUUCUACA CCACACGCUG CGGACGAGAA GA - #AAAUCCUC      120     - AGCGACCUCU CGGCUCCAUC UAGUACUACA GCAACCCCCG CAGACAAGGA GC - #ACCGUCCU      180     - AAAUCGUCGU CCAGCAAUAA UGCGGUCUCG GUCAACGAAG UCGAUGCGCU UA - #UUGCGCAC      240     - CUGCCAGAAG ACGAGAGGCA GGUCUUGAAG ACGCAGCUGG AGGAGAUCAA AG - #UAAACAUC      300     - UCCUUCUUCG GUCUCUGGCG GUAUGCAACA AAGAUGGAUA UACUUAUCAU GG - #UAAUCAGU      360     - ACAAUCUGUG CCAUUGCUGC CGCGUCGACU UUCCAGAGGA UAAUGUUAUA UC - #AAAUCUCG      420     - UACGACGAGU UCUAUGAUGA AUUGACCAAG AACGUACUGU ACUUCGUAUA CC - #UCGGUAUC      480     - GGCGAGUUUG UCACUGUCUA UGUUAGUACU GUUGGCUUCA UCUAUACCGG AG - #AACACGCC      540     - ACGCAGAAGA UCCGCGAGUA UUACCUUGAG UCUAUCCUGC GCCAGAACAU UG - #GCUAUUUU      600     - GAUAAACUCG GUGCCGGGGA AGUGACCACC CGUAUAACAG CCGAUACAAA CC - #UUAUCCAG      660     - GAUGGCAUUU CGGAGAAGGU CGGUCUCACU UUGACUGCCC UGGCGACAUU CG - #UGACAGCA      720     - UUCAUUAUCG CCUACGUCAA AUACUGGAAG UUGGCUCUAA UUUGCAGCUC AA - #CAAUUGUG      780     - GCCCUCGUUC UCACCAUGGG CGGUGGUUCU CAGUUUAUCA UCAAGUACAG CA - #AAAAGUCG      840     - CUUGACAGCU ACGGUGCAGG CGGCACUGUU GCGGAAGAGG UCAUCAGCUC CA - #UCAGAAAU      900     - GCCACAGCGU UUGGCACCCA AGACAAGCUU GCGAAGCAGU AUGAGGUCCA CU - #UAGACGAA      960     - GCUGAGAAAU GGGGAACAAA GAACCAGAUU GUCAUGGGUU UCAUGAUUGG CG - #CCAUGUUU     1020     - GGCCUUAUGU ACUCGAACUA CGGUCUUGGC UUCUGGAUGG GUUCUCGUUU CC - #UGGUAGAU     1080     - GGUGCAGUCG AUGUGGGUGA UAUUCUCACA GUUCUCAUGG CCAUCUUGAU CG - #GAUCGUUC     1140     - UCCUUGGGGA ACGUUAGUCC AAAUGCUCAA GCAUUUACAA ACGCUGUGGC CG - #CGGCCGCA     1200     - AAGAUAUUUG GAACGAUCGA UCGCCAGUCC CCAUUAGAUC CAUAUUCGAA CG - #AAGGGAAG     1260     - ACGCUCGACC AUUUUGAGGG CCACAUUGAG UUACGCAAUG UCAAGCAUAU UU - #ACCCAUCU     1320     - AGACCCGAGG UCACCGUCAU GGAGGAUGUU UCUCUGUCAA UGCCCGCUGG AA - #AAACAACC     1380     - GCUUUAGUCG GCCCCUCUGG CUCUGGAAAA AGUACGGUGG UCGGCUUGGU UG - #AGCGAUUC     1440     - UACAUGCCUG UUCGCGGUAC GGUUUUGCUG GAUGGCCAUG ACAUCAAGGA CC - #UCAAUCUC     1500     - CGCUGGCUUC GCCAACAGAU CUCUUUGGUU AGCCAGGAGC CUGUUCUUUU UG - #GCACGACG     1560     - AUUUAUAAGA AUAUUAGGCA CGGUCUCAUC GGCACAAAGU ACGAGAAUGA AU - #CCGAGGAU     1620     - AAGGUCCGGG AACUCAUCGA GAACGCGGCA AAAAUGGCGA AUGCUCAUGA CU - #UUAUUACU     1680     - GCCUUGCCUG AAGGUUAUGA GACCAAUGUU GGGCAGCGUG GCUUUCUCCU UU - #CAGGUGGC     1740     - CAGAAACAGC GCAUUGCAAU CGCCCGUGCC GUUGUUAGUG ACCCAAAAAU CC - #UGCUCCUG     1800     - GAUGAAGCUA CUUCGGCCUU GGACACAAAA UCCGAAGGCG UGGUUCAAGC AG - #CUUUGGAG     1860     - AGGGCAGCUG AAGGCCGAAC UACUAUUGUG AUCGCUCAUC GCCUUUCCAC GA - #UCAAAACG     1920     - GCGCACAACA UUGUGGUUCU GGUCAAUGGC AAAAUUGCUG AACAAGGAAC UC - #ACGAUGAA     1980     - UUGGUUGACC GCGGAGGCGC UUAUCGCAAA CUUGUGGAGG CUCAACGUAU CA - #AUGAACAG     2040     - AAGGAAGCUG ACGCCUUGGA GGACGCCGAC GCUGAGGAUC UCACGAAUGC AG - #AUAUUGCC     2100     - AAAAUCAAAA CUGCGUCAAG CGCAUCAUCC GAUCUCGACG GAAAACCCAC AA - #CCAUUGAC     2160     - CGCACGGGCA CCCACAAGUC UGUUUCCAGC GCGAUUCUUU CUAAAAGACC CC - #CCGAAACA     2220     - ACUCCGAAAU ACUCAUUAUG GACGCUGCUC AAAUUUGUUG CUUCCUUCAA CC - #GCCCUGAA     2280     - AUCCCGUACA UGCUCAUCGG UCUUGUCUUC UCAGUGUUAG CUGGUGGUGG CC - #AACCCACG     2340     - CAAGCAGUGC UAUAUGCUAA AGCCAUCAGC ACACUCUCGC UCCCAGAAUC AC - #AAUAUAGC     2400     - AAGCUUCGAC AUGAUGCGGA UUUCUGGUCA UUGAUGUUCU UCGUGGUUGG UA - #UCAUUCAG     2460     - UUUAUCACGC AGUCAACCAA UGGUGCUGCA UUUGCCGUAU GCUCCGAGAG AC - #UUAUUCGU     2520     - CGCGCGAGAA GCACUGCCUU UCGGACGAUA CUCCGUCAAG ACAUUGCUUU CU - #UUGACAAG     2580     - GAAGAGAAUA GCACCGGCGC UCUGACCUCU UUCCUGUCCA CCGAGACGAA GC - #AUCUCUCC     2640     - GGUGUUAGCG GUGUGACUCU AGGCACGAUC UUGAUGACCU CCACGACCCU AG - #GAGCGGCU     2700     - AUCAUUAUUG CCCUGGCGAU UGGGUGGAAA UUGGCCUUAG UUUGUAUCUC GG - #UUGUGCCG     2760     - GUUCUCCUGG CAUGCGGUUU CUACCGAUUC UAUAUGCUAG CCCAGUUUCA AU - #CACGCUCC     2820     - AAGCUUGCUU AUGAGGGAUC UGCAAACUUU GCUUGCGAGG CUACAUCGUC UA - #UCCGCACA     2880     - GUUGCGUCAU UAACCCGGGA AAGGGAUGUC UGGGAGAUUU ACCAUGCCCA GC - #UUGACGCA     2940     - CAAGGCAGGA CCAGUCUAAU CUCUGUCUUG AGGUCAUCCC UGUUAUAUGC GU - #CGUCGCAG     3000     - GCACUUGUUU UCUUCUGCGU UGCGCUCGGG UUUUGGUACG GAGGGACACU UC - #UUGGUCAC     3060     - CACGAGUAUG ACAUUUUCCG CUUCUUUGUU UGUUUCUCCG AGAUUCUCUU UG - #GUGCUCAA     3120     - UCCGCGGGCA CCGUCUUUUC CUUUGCACCA GACAUGGGCA AGGCGAAGAA UG - #CGGCCGCC     3180     - GAAUUCCGAC GACUGUUCGA CCGAAAGCCA CAAAUUGAUA ACUGGUCUGA AG - #AGGGCGAG     3240     - AAGCUCGAAA CGGUGGAAGG UGAAAUCGAA UUUAGGAACG UGCACUUCAG AU - #ACCCGACC     3300     - CGCCCAGAAC AGCCUGUCCU GCGCGGCUUG GACCUGACCG UGAAGCCUGG AC - #AAUAUGUU     3360     - GCGCUUGUCG GACCCAGCGG UUGUGGCAAG AGUACCACCA UUGCAUUGCU UG - #AGCGCUUU     3420     - UACGAUGCGA UUGCCGGGUC CAUCCUUGUU GAUGGGAAGG ACAUAAGUAA AC - #UAAAUAUC     3480     - AACUCCUACC GCAGCUUUCU GUCACUGGUC AGCCAGGAGC CGACACUGUA CC - #AGGGCACC     3540     - AUCAAGGAAA ACAUCUUACU UGGUAUUGUC GAAGAUGACG UACCGGAAGA AU - #UCUUGAUU     3600     - AAGGCUUGCA AGGACGCUAA UAUCUACGAC UUCAUCAUGU CGCUCCCGGA GG - #GCUUUAAU     3660     - ACAGUUGUUG GCAGCAAGGG AGGCAUGUUG UCUGGCGGCC AAAAGCAACG UG - #UGGCCAUU     3720     - GCCCGAGCCC UUCUUCGGGA UCCCAAAAUC CUUCUUCUCG AUGAAGCGAC GU - #CAGCCCUC     3780     - GACUCCGAGU CAGAAAAGGU CGUCCAGGCG GCUUUGGAUG CCGCUGCCCG AG - #GCCGAACC     3840     - ACAAUCGCCG UUGCACACCG ACUCAGCACG AUUCAAAAGG CGGACGUUAU CU - #AUGUUUUC     3900     - GACCAAGGCA AGAUCGUCGA AAGCGGAACG CACAGCGAAC UGGUCCAGAA AA - #AGGGCCGG     3960     #4002              ACUU GCAGAGCUUG GGCAAGGGCC AU     - X-11766     - - 16     __________________________________________________________________________ 

We claim:
 1. A DNA compound that comprises an isolated DNA sequence that lacks an intron and encodes SEQ ID NO:2.
 2. The DNA compound of claim 1 which comprises the isolated DNA sequence which is SEQ ID NO:
 1. 3. A method for constructing a transformed host cell capable of expressing a protein comprising SEQ ID NO:2, said method comprising transforming a host cell with a recombinant DNA vector that comprises an isolated DNA sequence of claim
 1. 4. A method for expressing a protein comprising SEQ ID NO:2 in a transformed host cell said method comprising culturing said transformed host cell of claim 3 under conditions suitable for gene expression.
 5. A vector comprising an isolated DNA sequence of claim
 1. 6. A vector comprising an isolated DNA sequence of claim
 2. 7. A host cell containing the vector of claim
 5. 8. A host cell containing the vector of claim
 6. 